NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A physics-informed impact model refined by multi-fidelity transfer learning

https://doi.org/10.1016/j.eml.2024.102223

Snapp, Kelsey L; Silverman, Samuel; Pang, Richard; Tiano, Thomas M; Lawton, Timothy J; Whiting, Emily; Brown, Keith A (November 2024, Extreme Mechanics Letters)

Full Text Available
Amortized Noisy Channel Neural Machine Translation

Pang, Richard Yuanzhe; He, He; Cho, Kyunghyun (July 2022, INLG 2022)

Noisy channel models have been especially effective in neural machine translation (NMT). However, recent approaches like "beam search and rerank" (BSR) incur significant computation overhead during inference, making real-world application infeasible. We aim to study if it is possible to build an amortized noisy channel NMT model such that when we do greedy decoding during inference, the translation accuracy matches that of BSR in terms of reward (based on the source-to-target log probability and the target-to-source log probability) and quality (based on BLEU and BLEURT). We attempt three approaches to train the new model: knowledge distillation, one-step-deviation imitation learning, and Q learning. The first approach obtains the noisy channel signal from a pseudo-corpus, and the latter two approaches aim to optimize toward a noisy-channel MT reward directly. For all three approaches, the generated translations fail to achieve rewards comparable to BSR, but the translation quality approximated by BLEU and BLEURT is similar to the quality of BSR-produced translations. Additionally, all three approaches speed up inference by 1-2 orders of magnitude.
more » « less
Full Text Available
SQuALITY: Building a Long-Document Summarization Dataset the Hard Way

https://doi.org/10.18653/v1/2022.emnlp-main.75

Wang, Alex; Yuanzhe Pang, Richard; Chen, Angelica; Phang, Jason; Bowman, Samuel R. (May 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Summarization datasets are often assembled either by scraping naturally occurring public-domain summaries -- which are nearly always in difficult-to-work-with technical domains -- or by using approximate heuristics to extract them from everyday text -- which frequently yields unfaithful summaries. In this work, we turn to a slower but more straightforward approach to developing summarization benchmark data: We hire highly-qualified contractors to read stories and write original summaries from scratch. To amortize reading time, we collect five summaries per document, with the first giving an overview and the subsequent four addressing specific questions. We use this protocol to collect SQuALITY, a dataset of question-focused summaries built on the same public-domain short stories as the multiple-choice dataset QuALITY (Pang et al., 2021). Experiments with state-of-the-art summarization systems show that our dataset is challenging and that existing automatic evaluation metrics are weak indicators of quality.
more » « less
Full Text Available
What Do NLP Researchers Believe? Results of the NLP Community Metasurvey

https://doi.org/10.18653/v1/2023.acl-long.903

Michael, Julian; Holtzman, Ari; Parrish, Alicia; Mueller, Aaron; Wang, Alex; Chen, Angelica; Madaan, Divyam; Nangia, Nikita; Pang, Richard Yuanzhe; Phang, Jason; et al (January 2023, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers))

Full Text Available
Designing lattices for impact protection using transfer learning

https://doi.org/10.1016/j.matt.2022.06.051

Gongora, Aldair E.; Snapp, Kelsey L.; Pang, Richard; Tiano, Thomas M.; Reyes, Kristofer G.; Whiting, Emily; Lawton, Timothy J.; Morgan, Elise F.; Brown, Keith A. (September 2022, Matter)

Full Text Available
QuALITY: Question Answering with Long Input Texts, Yes!

Bowman, Samuel R.; Chen, Angelica; He, He; Joshi, Nitish; Ma, Johnny; Nangia, Nikita; Padmakumar, Vishakh; Pang, Richard Yuanzhe; Parrish, Alicia; Phang, Jason; et al (May 2022, NAACL 2022)

To enable building and testing models on long-document comprehension, we introduce QuALITY, a multiple-choice QA dataset with context passages in English that have an average length of about 5,000 tokens, much longer than typical current models can process. Unlike in prior work with passages, our questions are written and validated by contributors who have read the entire passage, rather than relying on summaries or excerpts. In addition, only half of the questions are answerable by annotators working under tight time constraints, indicating that skimming and simple search are not enough to consistently perform well. Our baseline models perform poorly on this task (55.4%) and significantly lag behind human performance (93.5%).
more » « less
Full Text Available
QuALITY: Question Answering with Long Input Texts, Yes!

https://doi.org/10.18653/v1/2022.naacl-main.391

Pang, Richard Yuanzhe; Parrish, Alicia; Joshi, Nitish; Nangia, Nikita; Phang, Jason; Chen, Angelica; Padmakumar, Vishakh; Ma, Johnny; Thompson, Jana; He, He; et al (January 2022, Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)

Full Text Available
Comparing Test Sets with Item Response Theory

Vania, Clara; Htut, Phu Mon; Huang, William; Mungra, Dhara; Yuanzhe Pang, Richard; Phang, Jason; Liu, Haokun; Cho, Kyunghyun; Bowman, Samuel R. (June 2021, Annual Meeting of the Association for Computational Linguistics)
null (Ed.)
Full Text Available

Search for: All records